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Features 




A COMPLETE FLOATING POINT ARITHMETIC 
SOLUTION FOR HIGH-SPEED PROCESSORS 
AND COPROCESSORS 


FULL 32-BIT AND 64-BIT FLOATING POINT 
FORMAT AND OPERATIONS, CONFORMING TO 
THE IEEE STANDARD FOR FLOATING POINT 
ARITHMETIC 


INTERFACE CHIPS TO POPULAR 32-BIT 
MICROPROCESSORS AVAILABLE 


FULL FUNCTION 

Addition 

Subtraction 

Multiplication 

Division 

Conversion to and from 32-bit two’s complement 

integers 

Absolute value 

Compare 


INTERFACES DIRECTLY TO 32-BIT BUSES 


SINGLE PHASE, EDGE-TRIGGERED CLOCKED 
INTERFACE WITH FULLY REGISTERED INPUTS 
AND OUTPUTS 


HIGH SPEED 

2.38 MFlops (420 ns) 32-bit add/subtract/convert 
and compare 

1.85 MFlops (540 ns) 64-bit add/subtract/convert 
and compare 

2.38 MFlops (420 ns) 32-bit multiply 
1.67 MFlops (600 ns) 64-bit multiply 
0.52 MFlops (1.92 ns) 32-bit divide 
0.26 MFlops (3.78 jjls) 64-bit divide 
Up to 3.33 MFlops (300 ns) for pipelined operations 
Up to 3.33 MFlops (300 ns) for chained operations 
32-bit data input or 32-bit data output operation 
every 60 ns 

STANDARD 64-PIN DIP AND 68-PIN PGA 
PACKAGES 


Description 

The WTL 1164 floating point multiplier and the WTL 
1165 floating point arithmetic logic (ALU) unit are 
designed to provide high-speed 32-bit and 64-bit 
floating point processing capability. By virtue of their 
high performance, simple interface and ease of 
control, the WTL 1164/1165 are ideal choices for high 
performance floating point coprocessors. In fact, the 
bus interface and control of the WTL 1164/1165 are 
so simple that all of the logic required to implement a 
memory-mapped coprocessor to industry standard 
microprocessors can be put on a single chip. A short 
application note explaining how a single chip, the WTL 
1163, can interface the WTL 1164/1165 to Intel’s 
80386 is attached. 

This two-chip set can handle both single (32-bit) and 
double (64-bit) precision IEEE formats and 
operations, as well as full 32-bit two’s complement 
integers. The NMOS VLSI design allows all of the 
functional elements to be combined on two chips while 
very flexible input/output path and control signals 
permit the WTL 1164/1165 to work in a broad range 
of bus systems and a wide variety of applications. 

The data format and floating point operations conform 
to the requirements of the IEEE Standard for Binary 


Floating Point, including all rounding modes, infinity, 
NaN, denormalized and zero operand representations, 
and the treatment of exceptions, such as overflow, 
underflow, divide-by-zero, invalid and inexact 
operations. This assures complete software portability 
between systems designed using these parts and other 
general purpose computer systems which may be used 
to prototype algorithms and applications software. A 
“FAST” mode of operation, which removes the time 
penalty of underflow exception handling by substitution 
of zero for denormalized numbers while retaining all 
the other features, is included. 

The WTL 1164/1165 use dedicated circuit arrays to 
perform the required functions, providing significantly 
faster processing than designs which must rely solely on 
sequential, clocked logic. The array flowthrough time 
for the WTL 1164 is under 240 ns for a single 
precision (32-bit) multiply and under 360 ns for a 
double precision (64-bit) multiply. For the WTL 1165, 
the array flowthrough time is under 240 ns for both 
single and double precision functions. These figures 
include the time for performing the arithmetic function 
as well as denormalization, renormalization and 
exponent adjustments. 
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Description, continued 


Although division operations do not occur as 
frequently as addition, subtraction or multiplication 
operations, systems performing floating point 
arithmetic in close accordance with the IEEE standard 
require a floating point division operation which meets 
the requirements for accuracy specified by the 
standard. The division operation provided on the WTL 
1165 floating point ALU fully conforms with the IEEE 
standard. 

Data input and output transfers may occur at a rate of 
one transfer per clock cycle, permitting the devices to 
be used in variety of bus configurations without 
degrading performance. Two double precision inputs 
can be loaded through the 32-bit input data port, X, in 
four clock cycles, and the result unloaded in two clock 
cycles. Two single precision inputs can be loaded in 
two clock cycles and the result unloaded in a single 
clock cycle. One or both operands can be stored 
internally, allowing multiple functions or repeated 

Pin Definitions 

X 31-0 

32-bit Input/Output Port 
S 3-0 

4- bit status port which indicates execeptions or 
conditions resulting from floating point operations. 

F 4-0 

5- bit function control port 
L 3-0 

The 4-bit load control is used to specify the site and 
destination of the data on the X Port and start the 
instruction. 

U 

Unload control is used to specify the most or least 
significant half of the result latch. 

CSL- 

Load enable; load control loaded only if CSL- is a 
logic “0” 

CSUX- 

This signal controls the direction of the internal XI Bus 
and serves as the synchronous output enable for the X 


operations with a constant to be performed on two 
operands. 

All inputs and outputs are fully registered. All are 
loaded on each positive-going transition of the clock. 
Transfers from input registers to the ALU or multiplier 
array are accomplished with load controls. Unload 
controls select transfers from the array to the output 
registers and control tri-stating of the output registers. 

A mode register selects optional characteristics that are 
not often changed, such as the selection of IEEE 
format, operation timing and rounding modes. The 
16-bit mode register is loaded in 4-bit segments from 
the function input registers. 

Arithmetic functions are selected by a 6-bit function 
control, which is transferred at the same time as the 
final operand. A 4-bit status output flags arithmetic 
exceptions and conditions. 


Port. CSUX- must not be asserted when loading data 
from the X Port. 

CSUS- 

Synchronous status output enable 
OE- 

Asynchronous status and output enable; to 
activate/deactivate outputs both CSUX- and OE- must 
be a logic “0’’/logic “1” with the proper timing. Status 
outputs are similarly controlled by OE- and CSUS-. 

CLK 

System clock; all registers are synchronized to the 
positive-going edge of the clock. 

Vcc 

5V power supply for input/output port; all Vcc pins 
must be connected. 

Vdd 

5 V power supply for internal circuits; all Vdd pins must 
be connected. 

GND 

Ground; all GND pins must be connected. 
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Block Diagram 



Figure 1. WTL 1164/WTL 1165 Block Diagram 


NOTATION 

The signal names used are described by the data path 
diagram (see Figure 1). The notation following a 
signal (i.e., OE-) indicates a negative true signal. The 
notation “SIGNAL1” indicates a signal that is produced 
by a single full register from another signal, named 
“SIGNAL”, and “SIGNAL2” indicates a signal produced 
by a second level of register. A register with no clock 
signal indicated is clocked unconditionally by the CLK 
signal, while others may have gated or enabled clocks. 


In some cases, half-registers (latches) are used instead 
of registers. These are marked “LAT1” if they are 
transparent when the CLK signal is high and latching 
when CLK is low, or “LAT2” if they are transparent 
when CLK is low and latching when CLK is high. 

Individual bits in a multi-bit signal are numbered from 
zero, starting with the least significant bit 
(little-endian). 
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Block Diagram, continued 


All input signals, with the sole exception of OE-, are 
synchronized on the rising edge of CLK which, in the 
tables below, is denoted edge #0. Actions which are 
taken on the rising edge following the associated input 
signal are listed under edge #1, and actions taken after 


two rising edges are listed under edge #2. Actions 
which are taken on the falling edge following the 
associated input signal are listed under edge #0.5, and 
actions taken after two falling edges are listed under 
edge #1.5. 


Specifications 

ABSOLUTE MAXIMUM RATINGS (Above Which The Useful Life May Be Impaired) 


Supply voltage.-0.5 to 7.0 V Storage temperature range 

Input voltage.-0.5 to 5.5 V Lead temperature (10 seconds) 

Output voltage .-0.5 to 5.5 V Junction temperature. 


Operating temperature range (Tcase) -55 °C to 125 °C 


-65°C to 150°C 

. 300°C 

. 175°C 


RECOMMENDED OPERATING CONDITIONS 


PARAMETER 

1164/1165-60 

1164/1165-80 

UNIT 

COMMERCIAL 

COMMERCIAL 

MIN 

NOM 

MAX 

MIN 

NOM 

MAX 

Supply voltage, V cc 

mi 

5.0 

5.25 

mi 

5.0 

5.25 

V 

Supply voltage, V DD 

©a 

5.0 

5.25 

mi 

5.0 

5.25 

V 

Operating temperature, T CASE 

KM 


80 

KM 


80 

°C 


DC ELECTRICAL CHARACTERISTICS, l 


PARAMETER 

TEST CONDITIONS 

MIN 

MAX 

UNIT 

NOTES 

V tH 

High-level input voltage 

V DD /V CC = MIN 

2.0 


V 


V,L 

Low-level input voltage 

V DD /V CC = MIN 


0.8 

V 


^OH 

High-level output voltage 

V DD /V CC = MIN, 1 oh - -1.0 mA 

2.4 


V 


V OL 

Low-level output voltage 

V DD /V CC = MIN, 1 OL = 4.0 mA 


0.4 

V 


■u 

Input leakage current 

V DD /V CC = MAX, V IN = 0 to V cc 


10 

HA 


1 LO 

Output leakage current 
(output disabled) 

V DD /V CC = MAX, V IN = 0 or V cc 


10 

|iA 


C|N 

Input capacitance 

V cc = MAX, V IN = 0 to Vcc 


5 

PF 

Note 2 

^OUT 

Output capacitance 

V cc = MAX, V OUT =0 to Vcc 


8 

PF 

Note 2 

(-'CLK 

Clock Input capacitance 

Vcc = MAX, V| N = 0 to Vcc 


15 

PF 

Note 2 

'cc*' 

DD Supply current, WTL1164 

V DD /V cc= MAX; T cy = MIN; TTL 
inputs 


600 

mA 

Note 3 

'cc + 1 

DD Supply current, WTL1165 

v dd /v cc= MAX '- t cy = M| N; TTL 
inputs 


700 

mA 

Note 3 


Note 1: Tested from Tcase = 0°C min to 80° C max; \6d and Vcc from 4.75 to 5.25V. 
Note 2: Not tested. 

Note 3: Worst case power is at 0° C. See Figure 4 for typical power vs. temperature curve. 
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Specifications, continued 
AC ELECTRICAL CHARACTERISTICS, 1 , 4 


PARAMETER 

TEST CONDITIONS 

1164/1165-60 

1164/1165-80 

UNIT 

MIN 

MAX 

MIN 

MAX 

Tcy 

Clock cycle time 

Figure 3 

60 

DC 

80 

DC 

ns 

Tch 

Clock high time 

Figure 3 

27 

DC 

35 

DC 

ns 

Tcl 

Clock low time 

Figure 3 

27 

1000 

35 

1000 

ns 

Ts 

Input setup time 

Figure 2 

15 


15 


ns 

Th 

Input hold time 

Figure 2 

3 


2 


ns 

Tdo 

Output delay time 

Figure 2 


35 


35 

ns 

Tvo 

Output valid time 

Figure 2; Note 6 

5 


3 


ns 

Toz 

Output disable time 

Figure 2; Notes 5 and 6 

3 

35 

3 

35 

ns 

Tzo 

Output enable time 

Figure 2 

3 

35 

3 

35 

ns 

Tfl 

Flowthrough time 








WTL1164 Floating point multiplier 


240 


240 

ns 


WTL1165 Floating point ALU 


240 


240 

ns 

Tac 

Accumulation time 








WTL1164 Floating point multiplier 


120 


160 

ns 


WTL1165 Floating point ALU 


60 


80 

ns 

Tnc 

Operand encoding time WTL 1164 


60 


80 

ns 

Top 

Operation time (using Tcy min) 







WTL1164 Floating point multiplier 







32-bit multiply 



240 


240 

ns 


64-bit multiply 



360 


400 

ns 


WTL1165 Floating point ALU 







32-bit divide 



1740 


2240 

ns 


64-bit divide 



3480 


4560 

ns 


all other 32-bit functions 


240 


240 

ns 


all other 64-bit functions 


240 


240 

ns 

Tla 

Total latency (using T cy min) 







WTL1164 Floating point multiplier 







32-bit multiply 



420 


480 

ns 


64-bit minimum latency multiply 


600 


700 

ns 


WTL1165 Floating point ALU 







32-bit divide 



1920 


2480 

ns 


64-bit divide 



3780 


4900 

ns 


32-bit single operand instructions 


360 


400 

ns 


64-bit single operand instructions 


420 


480 

ns 


32-bit double operand instructions 


420 


480 

ns 


64-bit double operand instructions 


540 


640 

ns 


Note 4: All transitions except Toz are measured at 1.5V level, as shown in Figure 2, with TTL inputs of 
0.4V and 3.5V. The test load in Figure 5 is used. 

Note 5: The test load in Figure 6 is used. 

Note 6: Guaranteed but not tested. 

Specifications subject to change without notice. 
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I/O Chracteristics 



Figure 2. WTL 1164/WTL 1165 Input/Output Timing 







c 

I/O Chracteristics, continued 
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Figure 5. Normal Load Circuit for Delay 
Measurements 



Figure 6. T oz Measurement Circuit 


Method of Operation 



LOAD CONTROLS 

The CSL- and L input signals control the loading of 
operands into the on-chip registers and into the 
arithmetic path. The signals also control the loading of 
operating modes into on-chip mode registers from the 
6-bit function code bus. 

Using CSL- and L , data presented on the external bus, 
X , may be written into the on-chip registers, A and B, 
where they pass directly into the arithmetic unit. 

The CSL- signal, if de-asserted (1), causes no load 
operations to be performed, regardless of the state of 
the L signal. Load operations are performed only if the 
CSL- signal is asserted. 

LOAD NOTES 

CSL- is often used as a chip select to allow the sharing 
of the L and F fields. An inverter can be used to 
generate CSL- for the other device if the L Field 
specifies a NOP when no loading is desired. 


If Lo is set, indicating a function is to be started, the F 
Register (FREG) is loaded and an operation is 
initiated. This also resets the pipeline advance timer 
and the accumulator advance timer. 

If Lo and CSL- are both asserted while an operation is 
in progress, it will cause that operation to abort but will 
not start the new operation. The result of an aborted 
operation is undefined. 

For the conversion of integer to single or double 
precision functions, the 32-bit integer operand must be 
loaded into the A Operand Register with the LAL (#12) 
or LALF (#13) load commands. 

Divide uses the A Operand Register for intermediate 
values. The A Register must be reloaded prior to 
starating another instruction following a divide 
operation. 

Single precision loads modify both halves of the A or B 
register. 
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Method of Operation, continued 


L-3 

*-2 

Li 

L- 0 

Mnemonic 

EDGE #0.5 


0 

0 

0 

0 

LNOP 




NOP 

0 

0 

0 

1 

LF 




FI -> FREG 

0 

0 

1 

0 

LBS 

XI -> 

B 

single] 


0 

0 

1 

1 

LBSF 

XI 

B 

single] 

FI -> FREG 

0 

1 

0 

0 

LBL 

XI -> 

B 

LSW double] 


0 

1 

0 

1 

LBLF 

XI -> 

B 

LSW double] 

FI -» FREG 

0 

1 

1 

0 

LBM 

XI 

B 

MSW double] 


0 

1 

1 

1 

LBMF 

XI 

B 

MSW double] 

FI -> FREG 

1 

0 

0 

0 

LMODE 




LOAD MODE 

1 

0 

0 

1 

- reserved 





1 

0 

1 

0 

LAS 

XI 

A 

'single] 


1 

0 

1 

1 

LASF 

XI -» 

A 

single] 

FI -> FREG 

1 

1 

0 

0 

LAL 

XI 

A 

LSW double] 


1 

1 

0 

1 

LALF 

XI -» 

A 

LSW double] 

FI -» FREG 

1 

1 

1 

0 

LAM 

XI 

A 

MSW double] 


1 

1 

1 

1 

LAMF 

XI 

A 

MSW double] 

FI -> FREG 


Table 1. L Input 


UNLOAD CONTROLS 

The unload controls, U, CSUX- and CSUS-, select the 
data to be presented on the external bus, X, and the 
status bus, S. The U signal controls whether the most 
significant word or the least significant word is sent to 
the X Bus. Single precision floating point results are 
unloaded by selecting the most significant word, while 


results in integer format are unloaded by selecting the 
least significant word. 

The CSUS- and CSUX- signals control tri-stating of 
the S and X buses, respectively, in a synchronous 
fashion. The OE- signal controls tri-stating of both the 
S and X Buses and is a fully asynchronous signal. In 
the chart below, XI refers to the XI Bus shown in 
Figure 1. 


CSUS - 

CSUX- 

U 

Mnemonic 

EDGE #0.5 

EDGE #1 

0 

0 

0 

ULST 

LSW result -» XI 

STREG1 S, 

XI -» X 

0 

0 

1 

UMST 

MSW result -» XI 

STREG1 -» S, 

XI X 

0 

1 

X 

UST 

X -> XI 

STREG1 -»S, 

X tri-stated 

1 

0 

0 

UL 

LSW result XI 

S tri-stated, 

XI -» X 

1 

0 

1 

UM 

MSW result -» XI 

S tri-stated, 

XI -» X 

1 

1 

X 

UNOP 

X XI 

S tri-stated, 

X tri-stated 


Table 2. U Input 


FUNCTION CONTROLS 

The function controls select the operation to be 
performed by the floating point units. Each chip has a 
separate set of function code definitions. 

The F signal is received on each clock edge and 
buffered as the FI Bus. An operation is started when 
ordered by the Lo signal. Starting an operation loads FI 
—► F LAT2, and resets the flowthrough and accumulate 
timers. 


In the tables below, F32 denotes a single precision 
floating point operand or result; F64 denotes a double 
precision value. F32/64 represents either a single or 
double precision floating point operand. W32 and W64 
denote single and double precision wrapped, 
normalized operands and results. U32 and U64 denote 
single and double precision unrounded, wrapped, 
normalized operands and results. 132 represents a 
32-bit integer value. 
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Method of Operation, continued 

When two operands are indicated, the A operand is 
given on the left and the B operand on the right. When 
only one operand is indicated, it must be loaded into 
the A Operand Register. 

When a double precision result is indicated, operands 
may be either single precision or double precision or 
mixed single and double precision. The load controls 
indicate the precision of the operands as they are 
loaded. Mixed precision operations must be loaded in 
the specified loading sequence (figures 7, 8 and 14). 

If an instruction is started when another instruction is 
in progress, in either the multiplier or ALU, both 
instructions are corrupted. While divide operations are 
in progress a new instruction may be started by first 
starting some arbitrary instruction (killing both 
instructions), waiting three cycles, then beginning the 
third instruction. 

Function Controls For Floating Point Multiplier (WTL 
1164) 

For the multiplier, the timing of the function is 
determined by the precision of the B operand. When 
the B operand is single precision, the multiplication is 
performed in the flowthrough operation time, Tfl; 
when the B operand is double precision, the time is 
increased by the accumulate time, Tac. 

The floating point multiplier uses functions provided on 
the floating point ALU for the handling of 
denormalized operands and results, significantly 
increasing multiplier performance and reducing system 
cost. A mode bit is provided to select whether full 
handling of denormalized operands is required. 

In IEEE mode, denormalized operands are detected in 
the multiplier and exceptions are generated when they 
are found, indicating which operands are 
denormalized. The original operands may then be sent 
to the floating point ALU, where the wrap function 


normalizes the operands and extends the range of the 
exponent. The modified operands may then be 
multiplied, using function codes F 2-0 = #2 through #7 in 
the floating point multiplier. In “FAST” mode, 
denormalized operands are treated as zero. 

Whether or not the operands are denormalized, the 
multiplier may also produce a number whose exponent 
is too small to be represented as a normalized floating 
point value. In IEEE mode, one of the exceptions, 
Underflow or Underflow & Inexact, is generated, and 
the value returned is a normalized number with 
extended range and without rounding. This value may 
be converted to a denormalized number or zero value 
by using the Unwrap Exact or Unwrap Inexact functions 
of the floating point ALU. In “FAST” mode, these 
results are given zero values and the Underflow or 
Underflow & Inexact exception is generated. 

Certain restrictions are imposed on the timing with 
which operands are loaded into the WTL 1164 floating 
point multiplier. The A operand is internally encoded 
into a more usable form immediately after the operand 
is loaded. The encoded A operand is loaded into an 
internal register when a multiplication function is 
indicated by the Lo signal. The time required for the 
encoding is Tnc, which is normally no greater than the 
time required to load the B operand. Thus, if the A 
operand is loaded first, followed by the B operand, the 
multiplication function can begin immediately upon 
loading the B operand. If the B operand must be 
loaded first, an additional cycle is required before 
starting the operation. 

In some applications, a series of multiplications may be 
performed with one of the two operands held constant. 
Ideally, in order to maximize performance, this 
constant operand is the A operand, so each 
multiplication can begin immediately upon loading the 
B operand. 
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Method of Operation, continued 


F 

2 F 1 

Fo 

Operation 

Description 

0 

0 

0 

F32 

* F32 

-» F32 / U32 

single multiply 

0 

0 

1 

F32 / F64 

* F32 / F64 

-> F64 / U64 

double multiply 

0 

1 

0 

W32 

* F32 

F32 / U32 

single multiply, A wrapped 

0 

1 

1 

W32 / W64 

* F32 / F64 

-» F64 / U64 

double multiply, A wrapped 

1 

0 

0 

F32 

* W32 

-> F32 / U32 

single multiply, B wrapped 

1 

0 

1 

F32 / F64 

* W32 / W64 

-» F64 / U64 

double multiply, B wrapped 

1 

1 

0 

W32 

' W32 

F32 / U32 

single multiply, A & B wrapped 

1 

1 

1 

W32 / W64 

* W32 / W64 

-> F64 / U64 

double multiply, A & B wrapped 


F 5 F 4 F 3 

Operation 

Description 

0 0 0 

A * 

B 

multiply 

0 0 1 

|A| ‘ 

B 

B times magnitude of A 

0 10 

A * 

B 


A times magnitude of B 

0 1 1 

i A | * 

B 


magnitude of A times B 

1 0 0 

_A * 

B 

multiply and negate 

1 0 1 

- |A| * 

B 

B times negative value of A 

1 1 0 

- A * 

B 


A times negative value of B 

1 1 1 

- |A| * 

B 


negative value of A times B 


Table 3. Function Controls for Floating Point Multiplier 


Function Controls For Floating Point ALU (WTL 1165) 

The function controls for the WTL 1165 are shown in 
Table 4. Mixed precision operations and divide 
operations may generate one of the denormalized input 
exceptions; all other operations can handle 
denormalized operands directly without the exception. 

When a DNRM exception is encountered in a 
mixed-mode operation, the DNRM input must first be 
converted to double precision before the instruction 
can be re-started. 

Like the WTL 1164, the WTL 1165 may produce a 
result whose exponent is too small to be represented as 
a normalized floating point value. When this result 
appears in IEEE mode, one of the exceptions 
Underflow or Underflow & Inexact is generated and the 
value returned is a normalized number with extended 
range and without rounding. This value may be 
converted to a denormalized number or zero value by 


using the Unwrap Exact or Unwrap Inexact functions of 
the floating point ALU. In “FAST” mode, these 
results are given zero values and the Underflow or 
Underflow & Inexact exception is generated. 

When performing the divide function in IEEE mode, 
denormalized operands are detected and exceptions 
are generated when they are found, indicating which 
operands are denormalized. The operands may then 
be loaded into the A operand of the floating point 
ALU, where the wrap function normalizes the operand 
and extends the range of the exponent. The modified 
operands may then be divided using function codes 
Fs-o = #14-15, #22-23, #30-31 in the floating point 
ALU. In “FAST” mode, denormalized operands are 
treated as zero, yielding a zero or infinite result and no 
denormalized operand exceptions. A divide-by-zero 
exception will occur, however, if a denormalized 
operand is used as a divisor in “FAST” mode. 
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Method of Operation, continued 


F s 

F 4 

F 3 

F 2 

F, 

F 0 


Operation 

Description 

0 

0 

0 

0 

0 

0 

(0) 

F32 

— 

F32 

-> 

F32/U32 

single subtract 

0 

0 

0 

0 

0 

1 

(1) 

F32/F64 

- 

F32/F64 


F64/U64 

double subtract 

0 

0 

0 

0 

1 

0 

(2) 


F32 

- 

F32I 

-> 

F32/U32 

single magnitude of difference 

0 

0 

0 

0 

1 

1 

(3) 


F32/F64 

- 

F32/F64I 


F64/U64 

double magnitude of difference 

0 

0 

0 

1 

0 

0 

(4) 


F32| 

- 

F32| 

-> 

F32/U32 

single subtract magnitudes 

0 

0 

0 

1 

0 

1 

(5) 


F32/F64I 

- 

F32/F64| 

-> 

F64/U64 

double subtract magnitudes 

0 

0 

0 

1 

1 

0 

(6) 

F32 

-f 

F32 

-> 

F32/U32 

single divide 

0 

0 

0 

1 

1 

1 

(7) 

F32/F64 

-4- 

F32/F64 

-> 

F64/U64 

double divide 

0 

0 

1 

0 

0 

0 

(8) 

-F32 

+ 

0 

-> 

F32/U32 

single negate plus zero 

0 

0 

1 

0 

0 

1 

0) 

-F32/F64 

+ 

0 

-» 

F64/U64 

double negate plus zero 

0 

0 

1 

0 

1 

0 

(10) 







0 

0 

1 

0 

1 

1 

(11) 







0 

0 

1 

1 

0 

0 

(12) 







0 

0 

1 

1 

0 

1 

(13) 







0 

0 

1 

1 

1 

0 

(14) 

W32 

4~ 

F32 


F32/U32 

single divide, A wrapped 

0 

0 

1 

1 

1 

1 

(15) 

W32/W64 

4~ 

F32/F64 

-A 

F64/U64 

double divide, A wrapped 

0 

1 

0 

0 

0 

0 

(16) 

F32 

+ 

F32 

-> 

F32/U32 

single add 

0 

1 

0 

0 

0 

1 

(17) 

F32/F64 

+ 

F32/F64 

-> 

F64/U64 

double add 

0 

1 

0 

0 

1 

0 

(18) 


F32 

+ 

F32I 

-> 

F32/U32 

single magnitude of sum 

0 

1 

0 

0 

1 

1 

(19) 


F32/F64 

+ 

F32/F64I 

-> 

F64/U64 

double magnitude of sum 

0 

1 

0 

1 

0 

0 

(20) 


F32I 

+ 

F32| 


F32/U32 

single add magnitude 

0 

1 

0 

1 

0 

1 

(21) 


F32/F641 

+ 

F32/F64| 

— > 

F64/U64 

double add magnitude 

0 

1 

0 

1 

1 

0 

(22) 

F32 

4- 

W32 

-> 

F32/U32 

single divide, B wrapped 

0 

1 

0 

1 

1 

1 

(23) 

F32/F64 

4- 

W32/W64 

-> 

F64/U64 

double divide, B wrapped 

0 

1 

1 

0 

0 

0 

(24) 

F32 

+ 

0 

-» 

F32/U32 

single plus zero 

0 

1 

1 

0 

0 

1 

(25) 

F32/F64 

+ 

0 

-> 

F64/U64 

double plus zero 

0 

1 

1 

0 

1 

0 

(26) 







0 

1 

1 

0 

1 

1 

(27) 







0 

1 

1 

1 

0 

0 

(28) 


F321 

+ 

0 

-> 

F32/U32 

single absolute value 

0 

1 

1 

1 

0 

1 

(29) 


F32/F64I 

+ 

0 

-> 

F64/U64 

double absolute value 

0 

1 

1 

1 

1 

0 

(30) 

W32 

4- 

W32 

—> 

F32/U32 

single divide, A & B wrapped 

0 

1 

1 

1 

1 

1 

31) 

W32/W64 

4- 

W32/W64 -> 

F64/U64 

double divide, A & B wrapped 

1 

0 

0 

0 

0 

0 

(32) 

Compare F32 

- F32 



single compare 

1 

0 

0 

0 

0 

1 

(33) 

Compare F32 / F64 - F32 / F64 

double compare 

1 

0 

0 

0 

1 

0 

(34) 







1 

0 

0 

0 

1 

1 

(35) 







1 

0 

0 

1 

0 

0 

(36) 

Compare 

F32| - |F32| 



single compare magnitude 

1 

0 

0 

1 

0 

1 

(37) 

Compare 

F32 / F64I - 

IF32 / F64I 

double compare magnitude 

1 

0 

0 

1 

1 

0 

(38) 







1 

0 

0 

1 

1 

1 

(39) 







1 

0 

1 

0 

0 

0 

(40) 

Compare F32 

- 0 



single compare with zero 

1 

0 

1 

0 

0 

1 

(41) 

Compare F32 / F64 - 0 



double compare with zero 

1 

0 

1 

0 

1 

0 

(42) 







1 

0 

1 

0 

1 

1 

(43) 







1 

0 

1 

1 

0 

0 

(44) 







1 

0 

1 

1 

0 

1 

(45) 







1 

0 

1 

1 

1 

0 

(46) 







1 

0 

1 

1 

1 

1 

(47) 







1 

1 

0 

0 

0 

0 

(48) 

U32 -> F32 (Exact) 



single unwrap exact value 

1 

1 

0 

0 

0 

1 

(49) 

U64 —> F64 (Exact) 



double unwrap exact value 

1 

1 

0 

0 

1 

0 

(50) 

F39 —^ W99 



single wrap denormalized value 

1 

1 

0 

0 

1 

1 

(51) 





double wrap denormalized value 

1 

1 

0 

1 

0 

0 

(52) 

1)32 —> F32 (Inexact) 



single unwrap inexact value 

1 

1 

1 

1 

0 

0 

1 

1 

0 

1 

1 

0 

(53) 

(54) 

U64->F64 (Inexact) 



double unwrap inexact vlaue 

1 

1 

0 

1 

1 

1 

(55) 








Table 4. Function Controls for Floating Point ALU 
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Method of Operation, continued 


Fg 

F 4 

F 3 

F 2 

F, 

F 0 


Operation 

Description 

1 

1 

1 

0 

0 

0 

(56) 

132 -> F32 

single convert to integer 

1 

1 

1 

0 

0 

1 

(57) 

132 —> F64 

double convert to integer 

1 

1 

1 

0 

1 

0 

(58) 

132 -> F32 

single float 

1 

1 

1 

0 

1 

1 

(59) 

132—> F64 

double float 

1 

1 

1 

1 

0 

0 

(60) 

F64 -> F32 / U32 

convert double to single 

1 

1 

1 

1 

0 

1 

(61) 

F32H> F64 

convert single to double 

1 

1 

1 

1 

1 

0 

(62) 



1 

1 

1 

1 

1 

1 

(63) 




Table 4. Function Controls for Floating Point ALU, continued 


Mode Controls 

The mode controls are not directly loaded from mode control segments to be loaded with the signal 
external pins. Instead, when the load mode command F3-0 , according to the following table, 
is presented on the L signal, F5-4 selects one of four 


f 5 f 4 

EDGE #0 

EDGE #0.5 

0 0 (0) 

F -> FI 

FI 3_o -> MODE 3 _o 

0 1 (1) 

F FI 

FI 3-0 MODE 7 _4 

1 0 ( 2 ) 

F "> FI 

FI 0 n MODE 

1 1 (3) 

F FI 

fi« -» mode ]£ 2 


Mode controls must not be altered while a floating 
point function is in progress. 


“FAST”/IEEE FORMAT MODE CONTROL 

MODEo controls the floating point format used for all operands and results are handled, according to the 
operations and the manner in which the denormalized following table. 


MODEo 

Function 

0 

Multiplier and ALU generate denormalized 

(IEEE mode) 


operand exceptions and produce UNRM 
values on underflow exceptions. 

1 

Multiplier and ALU flush denormalized operands 
to zero and round underflow results to zero. 

(FAST mode) 


ROUNDING MODE CONTROL FOR ALU FIX FUNCTION 

MODEi controls the IEEE rounding mode for the ALU Double to Integer, according to the following table, 
functions Convert Single to Integer and Convert 


MODEi 

Function 

0 

1 

Round according to default rounding mode (MODE 3-2) 

Round toward zero, regardless of the default rounding mode 
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Mode Controls, continued 


ROUNDING MODE CONTROL 

MODE3-2 control the IEEE rounding mode for all Single to Integer and Convert Double to Integer, 

operations, except for the ALU functions Convert according to the following table. 


MODE 3-2 

Function 

00 

Round toward nearest value or even significand if a tie 

01 

Round toward zero 

10 

Round toward positive infinity 

11 

Round toward negative infinity 


FLOWTHROUGH TIMER CONTROL 

MODE7-5 allow the number of clock cycles required to 
meet the flowthrough time Tfl to be selected using 
clock periods other than 60 ns. The value for Tfl 
which should be used is specified on page 5 under “AC 
ELECTRICAL CHARACTERISTICS”. Using 240 ns 
for Tfl and the recommended clock speed of 60 ns, 
the value, k, which should be loaded into MODE7-5 is 


three. To determine k for other clock periods or 
flowthrough times, the following formula may be used 
where I means round up to the next integer greater 
than or equal to the result: 

MODE 7-5 = T (Tfl -f- Clock Period) - 1 


Clock Frequency (MHz) 

Clock Period (ns) 

r T fl -r- Clock Period 

MODE ?■ 

-5 

20.0 

50 

5 

100 

(4) 

16.7 

60 

4 

011 

(3) 

14.3 

70 

4 

011 

(3) 

12.5 

80 

3 

010 

(2) 

8.33 

120 

2 

001 

(1) 


The flowthrough timer should never be set to 0. 

ACCUMULATE TIMER CONTROL 

If the B operand on the WTL 1164 is single precision, 
the multiplication is performed as a flowthrough 
operation, but if the B operand is double precision two 
passes must be made through the multiplier array. 
After the first pass the partial results of the 
multiplication are held in an accumulate register. 
These partial results are added to the results of the 
second pass to form the unrounded product. The 
minimum time required for the first pass through the 
array is called the accumulation time. For a given clock 
frequency the time taken for this accumulation 
operation is controlled by MODE 11 - 10 . MODE 11-10 
control when the Internal Accumulator Register in the 
multiplier is clocked. Loading a value, k, into 


MODE 11-10 causes the accumulator to be clocked k + 1 
cycles after an operation is started. 

Most functions of the WTL 1165 are performed as a 
flowthrough operation, except for divide operations. 
Divide operations on the WTL 1165 are performed in 
a bit-sequential fashion and require 25 iterations for a 
single precision divide and 54 iterations for a double 
precision divide. The time taken for each iteration is 
controlled by MODE11-10. MODEi 1-10 control the rate at 
which the internal accumulator register in the divide 
circuitry is clocked. Loading a value, k, into MODEi 1-10 
causes the accumulator to be clocked every k + 1 
cycles after the start of a divide operation. 
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Mode Controls, continued 


In general, the value used for MODE 11-10 will not be the 
same for both chips. The value for Tac which should be 
used is specified on page 5 under “AC ELECTRICAL 
CHARACTERISTICS”. The formula for calculating 
the value to be loaded into MODEn-iois given below: 

MODE 11-10 = I(Tac- 7 - Clock Period) - 1 

Where f means round up to the next integer greater 
than or equal to the result. 


The value which should be loaded into MODE11-10 for 
the WTL 1164 using 120 ns for Tac and a 60 ns clock 
is one. Values for other clock frequencies are shown in 
the table below: 


Clock Frequency (MHz) 

Clock Period (ns) 


MODE ii- 

-10 

20.0 

50 

3 

10 

(2) 

16.7 

60 

2 

01 

(1) 

14.3 

70 

2 

01 

(1) 

11.1 

90 

2 

01 

(1) 

8.33 

120 

1 

00 

(0) 


The value which should be loaded into MODEn-iofor zero. Values for other clock frequencies are shown in 
the WTL 1165 using 60 ns for Tac and a 60 ns clock is the table below: 


Clock Frequency (MHz) 

Clock Period (ns) 


MODE 

11-10 

20.0 

50 

2 

01 

(i) 

16.7 

60 

1 

00 

(0) 

14.3 

70 

1 

00 

(0) 

11.1 

90 

1 

00 

(0) 

8.33 

120 

1 

00 

(0) 


After the final firing of the accumulate timer on either timer expires. Operation timing can thus be determined 
the WTL 1164 or WTL 1165, the flowthrough timer is from the following formulae, 
started, terminating the operation after the flowthrough 


Operation 

Time required in cycles 

Multiply, B operand is single precision 

Tfl 

Multiply, B operand is double precision 

Tac + T FL 

Divide, result is single precision 

25 * T AC + T FL 

Divide, result is double precision 

54 * T ac +T fl 

Other WTL1165 operations 

Tfl 


RESERVED MODE CONTROL BITS 

MODE15-12, MODE9-8 and MODE4 on the floating point 
ALU chip and the floating point multiply chip are 
currently reserved for future definition. These mode 


control bits should be set to zero to assure compatibility 
with future derivatives of the WTL 1164/1165. 
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Result Status 

The S Bus indicates any exceptions or conditions that 
result from operations performed by the floating point 
units. The status bits associated with an operation have 
the same timing as the results. The S Bus is controlled 
by the CSUS- control, which causes the S Bus to be 
driven two clock cycles after CSUS is asserted. For 


comparison operations, the S Bus indicates the 
condition resulting from the comparison; for all other 
operations, the S Bus indicates exception status arising 
from the associated operation. For all operations, the S 
Bus has the following meanings. 


s 

3 s 

2 S 

i So 


Comparison Condition 

Exception Status 

0 

0 

0 

0 

(0) 

Equal 

Result = +0 or -0, exact 

0 

0 

0 

1 

(1) 

Less than 

Result = -^infinity or -infinity, exact 

0 

0 

1 

0 

(2) 

Greater than 

Result finite and ^ 0, exact 

0 

0 

1 

1 

(3) 


Result finite and ^ 0, inexact 

0 

1 

0 

0 

(4) 


- not used 

0 

1 

0 

1 

(5) 


Overflow & inexact 

0 

1 

1 

0 

(6) 


Underflow 

0 

1 

1 

1 

(7) 


Underflow & inexact 

1 

0 

0 

0 

(8) 


Operand A is Denormalized 

1 

0 

0 

1 

(9) 


Operand B is Denormalized 

1 

0 

1 

0 

(10) 


Operands A & B are Denormalized 

1 

0 

1 

1 

(11) 


Divide by Zero 

1 

1 

0 

0 

(12) 


Operand A is NaN 

1 

1 

0 

1 

(13) 


Operand B is NaN 

1 

1 

1 

0 

(14) 


Operands A & B are NaN 

1 

1 

1 

1 

(15) 

Unordered 

Invalid Operation 


Exception status signals #0-#3 indicate normal 
completion of the operation. The specific value of the 
status signal indicates the type of result produced, 
either zero, non-zero or infinity as well as exact or 
inexact. Signals #0-#2 indicate an exact result, as 
defined by the IEEE standard. Signals #3, #5, #6 and 
#7 indicate an inexact result. While #6 does not imply 
inexact directly, the result after unwrapping may be 
exact or inexact and the status bits correctly indicate 
exactness at the conclusion of the unwrap operation. 

Signals #6 and #7 have special meaning in IEEE mode. 
These signals indicate the result is smaller than the 
minimum magnitude normalized number representable 
in the specified format and a UNRM value is presented 
as a result. This value should be converted to a legal 


denormalized number by the Unwrap Exact (for signal 
#6) or Unwrap Inexact (for signal #7) functions of the 
floating point ALU. The exception status indicated by 
the ALU at the conclusion of the unwrap operation, 
either #6 or #7, properly indicates the exactness of the 
result. 

Status numbers eight through 11 signify deneormalized 
input values (DIN) in IEEE mode. Status numbers 12 
through 15 signify invalid operations (INV) and source 
NaNs. 

Under certain conditions, multiple exceptions can 
occur. These exceptions may be resolved with the 
following priority table, where higher-priority 
exceptions will mask lower-priority exceptions. 
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Result Status, continued 



Priority 

Exception 

Highest 

Operands A & B are NaN 

Operand A is NaN 

Operand B is NaN 

Invalid Operation 

Divide by Zero 

Operands A & B are denormalized 
Operand A is denormalized 

Operand B is denormalized 

Underflow & inexact 

Underflow 

Overflow & Inexact 

Lowest 

Result is finite and <> 0, inexact 


Timing 

The WTL 1164 and WTL 1165 have very flexible 
loading controls to optimize loading order for total 
system performance. Figures 3 through 11 show 
loading sequences which minimize flowthrough times 
and are the preferred loading sequences. Figures 15 
and 16 show loading sequences which are also valid but 
incur somewhat longer flowthrough times. While all of 
the loading sequences shown are guaranteed, 
restricting the loading order to the preferred loading 
sequences will minimize timing changes when using 


higher clock rates on future WTL 1164/1165 
compatable devices. 

PRECISION PREFERRED LOADING SEQUENCES 

To conform to the preferred loading sequence, the 
loading order must be followed except where indicated. 
Other sequences may still be acceptable (see alternate 
loading sequences) but minor timing changes may be 
required for use with future WTL 1164/1165 family 
devices at higher clock speeds. 



Figure 7. Single Precision Multiply 
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Timing, continued 
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Unloading CM or CL may be done in either order 
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Figure 10. Double Precision Multiply 


CLK 



Last load may be AL or BL but should not be AM or BM 


Figure 11. Double Precision Divide 
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Timing, continued 



Figure 12. Double Precision ALU Operation 


MIXED-MODE LOADING SEQUENCES 

When operating with mixed single and double precision precision values before the operation can be 

operands, the loading order is especially important. completed. For that reason, mixed-mode operations 

Special values such as zero, infinity and denormalized must be loaded as specified below, 
numbers require conversion from single to double 



Figure 13. Mixed-mode Minimum Latency Multiply 
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Timing, continued 


CLK 

X 

L, CSL- 
F 

U, CSU- 
S 



Single precision operand may be A or B but must be loaded first. 


Figure 14. Mixed-mode ALU Operation 


ALTERNATE LOADING SEQUENCES 


Alternate loading sequences are those which cause 
flowthrough or encode times to be greater than the 
preferred loading sequences. Except for mixed-mode, 
operands may be loaded in any order. Alternate 


loading sequences are guaranteed by the Tfl and Tnc 
specifications but may not be optimum, mixed-mode 
operands must be loaded in the order specified by 
figures 13 and 14. 



Figure 15. Single Precision Multiply, B Operand First 



t 
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Timing, continued 



Figure 16. Double Precision ALU Operation, MSW Loaded Last 


OPTIMIZING FLOATING POINT PERFORMANCE 

On the WTL 1164, if the least significant half of the B accumulate time, Tac. This is possible because the 
operand is loaded first, the floating point multiplication initial accumulation operation requires only the least 
can be started before loading the most significant half significant word of the B operand. The preferred 

of the B operand. (See figures 17, 18 and 19.) The loading order for the A operand is LSW followed by 

most significant half must be loaded on the following MSW. 
cycle and the cycle time must be less than the 



Figure 17. Double Precision Minimum Latency Multiply, Preferred Loading 
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Timing, continued 


CLK 

X 

L, CSL- 
F 

U, CSU- 
S 



Figure 18. Double Precision Minimum Latency Multiply, Alternate Loading 



Figure 19. Alternate Mixed Mode Multiply 


The WTL 1164/1165 contain a result latch at the end 
of the floating point hardware, allowing results to be 
loaded directly into one or both of the operand 
registers (a chained operation) or to start another 
operation before unloading the result (a pipelined 
operation). 

Results may be chained from a result register back to 
an operand register by selecting both a load and unload 
operation on the same clock cycle. The result is passed 
onto the internal bus and then clocked into the 
indicated operand register. Normally, the result is also 
presented on the X Bus on the next clock cycle, which 
can be useful for ensuring that sufficient internal state 
is retained externally for aborting and later resuming 


an operation of this kind. If avoiding driving the X Bus 
is desirable, it can be tri-stated by bringing the OE- pin 
to an inactive (high) level. The time for a series of 
operations where the result is chained back to one of 
the operand registers, and another operand is loaded 
from the X Bus may be calculated as: 

Trc = T"cy + T op 

For single precision operations (see Figure 20), and for 
double precision operations (see Figure 21), the time 
for each chained operation is: 

T rc = 3 T cy + T OP 
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Timing, continued 
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Timing, continued 

The WTL 1164/1165 may be used in a pipelined 
fashion by loading new operands and starting a second 
operation before unloading the result of the previous 
operation. This previous result must be unloaded by 
selecting unload operations immediately after loading 
the final operand for the second operation. Due to the 
one cycle delay between the external X Bus and the 
internal XI Bus in each direction, two cycles will elapse 
before the result appears on the bus. In order to assure 
that sufficient time to unload the result before the 
completion of the next operation reloads the result 
register, Top must be at least 3*TcYfor single precision 
results and at least 4*Tcyfor double precision results. 
The effective pipeline rate can be calculated from the 
formula: 


T RC= ma *( T CY +T OP’ 5 ‘ T CV> 

For single precision operations (see Figure 22), and for 
double precision operations (see Figure 23), the 
effective pipeline rate is: 

Tr C = max ( 3 Tcy + T op , 8 Tcy ) 

If a double precision multiply is performed when 
loading the least-significant B operand first, the 
effective pipeline rate is: 

Tr C = max ( 2 Tcy + T” op > 8 Tcy ) 


Note that Top must be at least 4*Tcy for this operation. 



Figure 22. Single Precision Pipelined Multiply 
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Timing, continued 



Figure 23. Double Precision Pipelined Multiply 


USING FASTER AND SLOWER CLOCK RATES 

Because the WTL 1164 and WTL 1165 use both 
clocked and unclocked logic in the arithmetic data 
path units, certain optimizations can be made when the 
parts are operated with clock rates that differ from the 


standard maximum clock rate. This permits greater 
performance in applications where the standard clock 
rate cannot be easily maintained. 
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Application Note 


WEITEK FLOATING POINT COPROCESSOR INTERFACE 


The simple interface and high speed of the WTL 
1164/1165 make it an ideal chip set for accelerating 
the floating point performance of popular 
microprocessors. All of the control required to 
implement the interface between the microprocessor 
and the WTL 1164/1165 can be put onto one chip. 


The WTL 1163 furnishes the interface between the 
WTL 1164/1165 and Intel’s 80386. The three chip set 
can then provide 3.5 mega Whetstone performance. 
The chip set is fully supported with Fortran, Pascal and 
C compilers. A block diagram of the three chip set 
showing the system interface is shown below. 



Figure 24. Coprocessor Block Diagram 


For more information on coprocessor interface chips, 
call WEITEK. 
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IEEE Compatibility 


The IEEE Standard for Binary Floating Point specifies 
floating point processor data formats, rounding modes 
and exception handling. The WTL 1164 and WTL 
1165 conform to the specification. The discussion 


below reviews IEEE implementation on the WTL 
1164/1165. A separate note describes how 
denormalized numbers (DNRMs) are handled. 


DATA FORMATS 


The WTL 1164/1165 perform both 32-bit and 64-bit format has a 24-bit sign-magnitude fraction field and 

IEEE standard floating point operations. The 32-bit an 8-bit exponent, in the following format: 


IEEE STANDARD SINGLE PRECISION 

31 30 23 22 0 



1 8 


23 


e 

f 

value 

name 

255 

not 0 

none 


NaN (Not A Number) 

255 

0 

s 

h) 

* infinity 

Infinity 



s 

e -127 


1 .. 254 

any 

(-1) 

* 2 * (l.f) 

Normalized number 



s 

-126 


0 

not 0 

(-1) 

* 2 * (O.f) 

Denormalized number 

0 

0 

s 

(-1) 

* 0 

Zero 


Exponent values for normalized single precision 
numbers range from one to 254, with exponents of 
zero and 255 being reserved for special operands. To 
calculate the value of a number in this format, the 
exponent is decremented by 127 (the “exponent bias” 
is +127), and the fraction has a one inserted before the 


binary point. (This is called the hidden bit.) The value 
of the number is then (-1) s X 2 0-127 X (l.f) . 

The 64-bit format has a 53-bit signed-magnitude 
fraction field and an 11-bit exponent, in the following 
format: 
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IEEE Compatibility, continued 


IEEE STANDARD DOUBLE PRECISION 

63 62 52 51 0 



111 52 


e 

f 

value 

name 

2047 

2047 

not 0 

0 

none 

(-1) * infinity 

NaN (Not A Number) 

Infinity 

1 .. 2046 

any 

s e-1023 

(-1) * 2 * (l.f) 

Normalized number 

0 

not 0 

S -1022 

(-1) * 2 * (O.f) 

Denormalized number 

0 

0 

M) S * o 

Zero 


Exponent values for normalized double precision 
numbers range from one to 2,046 with exponents of 
zero and 2,047 being reserved for special operands. To 
calculate the value of a number in this format, the 


exponent is decremented by 1,023 (the exponent bias 
is +1,023), and the fraction has a one inserted before 
the binary point. Thus the value of a double precision 
number is (-1) s X 2 e-1023 X (l.f). 



INTEGER FORMAT 

The WTL 1164/1165 family supports conversion 
operations to and from 32-bit two’s complement 
integers. The format is given below. 


INTEGER FORMAT 


31 30 


1 31 


0 


s 

value 

name 

0 

CO 

CM 

1 

positive integer 

i 

negative integer 
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PRELIMINARY DATA 


IEEE Compatibility, continued 

Several number types are required to implement the 
standard. These are normalized numbers, 
denormalized numbers, wrapped numbers, infinity and 
zero. 

Normalized Numbers (NRM) 

Most caculations are performed on normalized 
numbers. For single precision, normalized numbers 
have an exponent that ranges from 00000001 to 
11111110 (one to 254) and a normalized fraction field 
(the leftmost or hidden bit is a one). In decimal 
notation, this allows one to represent a range of both 
positive and negative numbers from roughly 10 38 to 
10” 38 with accuracy to seven decimal places. Double 
precision numbers have an exponent ranging from one 
to 2,046 and a normalized fraction field. 

Infinity (INF) 

Infinity has an exponent of all ones and a fraction field 
equal to zero. Both positive and negative infinity are 
allowed. 

ZERO 

ZERO has an exponent of zero, a hidden bit equal to 
zero and a value of zero in the fraction field. Both +0 
and -0 are supported. 

Wrapped Numbers (WNRM) 

A wrapped number is created by normalizing a DNRM’s 
fraction field and subtracting from the exponent the 


number of shift positions required. (Normalizing is 
accomplished by left shifting until the hidden bit 
contains a one.) The value of the exponent is equal to 
(1 - (the number of shifts)) and is represented in two’s 
complement. 

Unrounded Normalized Number (UNRM) 

A UNRM is the result of an operation that has a 
magnitude less than the minimum representable 
normalized number. A UNRM has a normalized 
fraction field, a wrapped exponent and a hidden bit 
equal to one. The minimum UNRM is attained by 
multiplying two DNRM.MINs. UNRMs are turned into 
DNRM values using the ALU’s unwrap function. 

Denormalized Numbers (DNRM) 

Denormalized numbers have a zero exponent and a 
denormalized (hidden bit equal to zero) non-zero 
fraction field. 

Not A Number (NaN) 

NaN, or not a number, is a special data format usually 
used as a flag for data flow control, for uninitialized 
variables or to signify an invalid operations such as 0 x 
oo. The format for a NaN is an exponent of all ones 
and a non-zero fraction. 


IEEE SINGLE PRECISION FORMATS SUPPORTED BY THE WTL 1164/1165 


OPERAND 

EXPONENT 

FRACTION 

HIDDEN BIT 

VALUE 

NaN 

255 

ANY 

N/A 


NONE 

INFINITY 

255 

ALL 0’s 

1 

(-1) S 
(-1)® 
H)® 
H) 
H) ! 
H>® 
H )® 
H )® 
(-U® 

OO 

NORM.MAX 
NORM 

NORM.MIN 
DNRM. MAX 
DNRM 

DNRM.MIN 
WNRM. MAX 

254 

1 to 254 

1 

0 

0 

0 

0 

ALL 1 's 
ANY 

ALL 0’s 

ALL 1 's 
ANY 

000...01 

ALL 1 's 

1 

1 

1 

0 

0 

0 

1 

X 2 1-127 X (2 > 

X2 -126 

X 2 -126 X 1 U 9 

X <?126 - 2 > 

X 2 X (O.f) 

X2" X2' 23 

-126 
x 2 ‘ 

WNRM 

0 to (-22) 

ANY 

1 

x2 !; 49 x o- f ) 

X 2 

WNRM.MIN 

-22 

ALL 0’s 

1 

(-D® 

H)® 

(-1) 

UNRM. MAX 

0 

ALL I’s 

1 

-126 

X 2 

UNRM.MIN 

-171 

ALL 0’s 

1 

-298 

X 2 

ZERO 

0 

ALL 0's 

0 

(-1) s 

0 
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IEEE Compatibility, continued 


The same formats are supported in double precision. mantissa and exponent in the pattern shown above. A 

The range of double precision numbers and their partial table of values is given below, 

values are obtained by substituting the double precision 


E 

F 

VALUE 

NAME 

MNEMONIC 

2047 

Not all zeros 

None 

Not a number 

NaN 

2047 

All zeros 

(-1) s * Infinity 

Infinity 

INF 

1-2046 

Any 


Normalized number 

NOR 

0 

Not all zeros 

(-1) s * 2' 1022 * (0.F) 

Denormalized number 

DNRM 

0 

Zero 

1 

U) 

» 

o 

Zero 

ZERO 


ROUNDING OPTIONS 

The WTL 1164/1165 support all four rounding modes 
of the IEEE standard — round to nearest, round 
toward zero, round toward plus infinity and round 
toward minus infinity. Rounding may be biased or 
unbiased. Biased rounding introduces a small offset in 
the direction of the bias. Positive bias, negative bias or 
a bias toward zero are specified in the IEEE format. 
Unbiased rounding rounds the result to the nearest 
representable number. In the case of a number exactly 
halfway between two representable numbers, the 
number is rounded toward the closest even number, 
resulting in half the numbers rounding up and half 
rounding down, on average. 


Round Toward Zero 

Rounds the result to the value closest to but not greater 
than the magnitude of the result. 

Round Toward Plus Infinity 

Rounds the result to the value closest to but not less 
than the result. 

Round Toward Minus Infinity 

Rounds the result to the value closest to but not greater 
than the result. 


Round To Nearest 

Rounds the result to the nearest representable value. If 
two numbers are equally near the result, the even 
number is chosen. 


EXCEPTION HANDLING 

The WTL 1164/1165 generate the exceptions specified 
in the IEEE standard for floating point operations. The 
status word corresponding to an operation is 
propagated through the array and pipeline registers in 
synchrony with the operands and partial results. The 
status outputs are registered when the output data is 
clocked into the output register and is valid until the 
next rising edge of the clock. The status outputs also 
indicate the result of a Compare operation. Thus 
Compare precludes the indication of other exceptions. 


Inexact (NXT) 

NXT is generated on the WTL 1164/1165 whenever 
there is a loss of accuracy. The chips compute results 
to higher precision than the number of mantissa bits 
that appear in the result. If any of the fraction bits less 
than the LSB was equal to one prior to rounding, then 
the inexact bit will be high. NXT will be signaled in the 
WTL 1165 if there is a partial or complete loss of 
significance in a float-to-fixed operation. 
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IEEE Compatibility, continued 
Overflow (OVF) 

OVF is generated when the result of a floating point 
operation overflows the largest representable 
normalized number. The result produced at the output 
is either infinity or the largest representable positive or 
negative number, depending upon the rounding mode 
as follows: 

+MAX.NRM if ((RM or RZ) and the result 

is positive) 

-MAX.NRM if ((RP or RZ) and the result 

is negative) 

+oo if ((RN or RP) and the result is positive) 

-oo if ((RN or RM) and the result is negative) 

Overflow is also generated when converting 

floating-point-to-fixed-point and the result overflows 
the 32-bit format. 

Underflow (UNF) 

When the result of an operation after rounding is less 
than the minimum normalized number, UNF is 

asserted. A result of exactly zero does not underflow. 

Divide By Zero (DVZ) 

The WTL 1165 will assert a DVZ exception when 
performing division on a normalized dividend and a 
zero divisor. The result is a properly signed infinity. 

Invalid Operation (INV) 

INV (status codes 12 through 15) will be asserted if a 
NaN is an input or if an invalid operation occurs. The 
invalid WTL 1164 operation is oo • 0. Invalid WTL 
1165 operations include oo oo, 0 0, subtraction of 

like infinities (oo - oo) and addition of opposite 
infinities (+oo - oo). When either input is NaN the 
status code indicates the source of the NaN operand(s). 
The result of any invalid operation (12 through 15) is 
NaN with the fraction and exponent all ones. The sign 
bit is zero. 

Denormalized Input (DIN) 

DIN is asserted whenever an operand is denormalized 
and the chip is in IEEE mode. 


of zero and a fraction field of all zeros. This gives users 
the ability to deal with numbers between NORM.MIN 
and ZERO. These numbers are known as denormals. 
Their format is given in the number format section. 
The IEEE standard has specified gradual underflow as 
the way to handle denormals. Many of the features of 
the WTL 1164/1165 are included to deal with 
denormals in a manner consistent with IEEE Standard 
754, Version 10.0. Since denormals are very close to 
zero, many applications can substitute zero for a 
denormal without a signifcant loss of accuracy. For 
these applications, a “FAST” mode is included which 
substitutes zero for all denormalized inputs to the WTL 
1164 and 1165. ZERO is also inserted for all UNRM 
outputs in “FAST” mode. 

For all arithmetic operations other than divide, the 
WTL 1165 handles denormalized inputs directly as it 
would handle any other number. 

Unfortunately, a floating point multiplier must either 
operate exclusively on normalized numbers or suffer 
large cost and performance penalties in dealing directly 
with denormals. A normalized format that yields an 
equivalent to a given denormalized number is the 
wrapped format. The number format table shows the 
equivalence of wrapped and denormalized numbers. 
To translate a denormalized number to a wrapped 
number, the fraction is normalized (shifted up so that a 
one is in the hidden bit) and one is subtracted from the 
exponent for every position shifted. The WTL 1164 
can correctly multiply either two wrapped numbers or a 
wrapped and a normalized number. To understand the 
full procedure, consider the following case. 

Assume one of the two input operands to the WTL 

1164 is a denormalized number. Four cycles after the 
input, the denorm exception is flagged. The 
denormalized operand must then be sent to the WTL 

1165 to be wrapped. Once wrapped, the operand can 
be sent back to the WTL 1164 for multiplication. The 
result of the multiplication will either be a normalized 
number or a UNRM. 

If the result is a UNRM, status bit So indicates either 
UNF (if all the truncated bits are equal to zero) or 
UNF-NXT (if any of the truncated bits is equal to one). 

No rounding will occur regardless of the rounding 
mode specified. 


GRADUAL UNDERFLOW 

The minimum normalized number has an exponent of 
one and a fraction field of zero. Zero has an exponent 
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IEEE Compatibility, continued 


The underflowed number may then be sent to the 
WTL 1165 for “unwrapping”. To unwrap a number, 
the fraction field is shifted right and the exponent 
incremented by one for each shift position. Status bit 


So must be used to conditionally execute the Unwrap 
Inexact or Unwrap Exact instruction. The rounding 
must be performed in the ALU. The unwrapping may 
have three possible results: 


RESULT 

EXCEPTION 

COMMENT 

DNRM 

UNF 

When the denormalized result is exact. Note that his 



result is possible only if the UNRAP EXACT instruction is 
is possible (i. e., both the input and the result must 
be exact.) 

DNRM 

UNF-NXT 

If the UNWRAP INEXACT instruction is executed or 
if the result of the UNWRAP EXACT instruction is inexact. 

ZERO 

UNF-NXT 

The result is zero, but the unwrapping has resulted 
in the loss of precision. 


Operations 

The following tables delineate the results that are 
obtained for all combinations of input data formats and 
rounding options, for both the WTL 1164 and the 


WTL 1165 in IEEE as well as “FAST” mode. The 
format used in the tables is STATUS: (status 
code)-Result. 


TABLE 5: FLOATING POINT ADD/SUBTRACT ("FAST” MODE) 

A/B 

ZERO 

DNRM 

NRM 

INF 

NaN 

NaN 

INV: 12-Nan 

INV:12-NaN 

INV:12-NaN 

INV:12-NaN 

INV:14-NaN 

INF 

OK*:1-INF 

OK*:1-INF 

OK*:1-INF 

INV:15-NaN (2) 
OK*:1-INF (1) 

INV:13-NaN 

NRM 

OK-.2-NRM 

OK:2,3-NRM 

OVF: 5-(4) 
OK:2,3-NRM 
UNF:6,7-ZERO 
OK:0-ZERO 

OK*:1-INF 

INV:13-NaN 

DNRM 

OK:0-ZERO (3) 

1 

OK:0-ZERO 

OK:2,3-NRM 

OK*:1-INF 

INV:13-NaN 

ZERO 

OK:0-ZERO (3) 

OK:0-ZERO (3) 

OK:2-NRM 

OK*:1-INF 

INV:13-NaN 


*lf an operand is INF, OK will be signaled rather than OVF (see Note 1) 
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PRELIMINARY DATA 


Operations, continued 


Notes: 


1. +INF+INF -*• +INF 

-INF-INF -*• -INF 


2. +INF-INF 
-INF+INF 


NaN (invalid operation) 
NaN (invalid operation) 


3. +ZERO+ZERO 
-ZERO-ZERO 
+ZERO-ZERO 
+ZERO-ZERO 
-ZERO+ZERO 
-ZERO+ZERO 


+ZERO (RN, RZ, RP, RM) 
-ZERO (RN, RZ, RP, RM) 
+ZERO (RN, RZ, RP) 
-ZERO (RM) 

+ZERO (RN, RZ, RP) 
-ZERO (RM) 


OVF will produce INF or MAX.NRM, depending upon the rounding mode: 
+MAX.NRM IF [(RM, RZ) AND (RESULT IS +)] 

-MAX.NRM IF [(RP, RZ) AND (RESULT IS -)] 

+INF IF [(RN, RP) AND (RESULT IS +)] 

-INF IF [(RN, RM) AND (RESULT IS -)] 


TABLE 6: FLOATING POINT MULTIPLICATION (“FAST” MODE) 

A/B 

ZERO 

DNRM 

NRM 

INF 

NaN 

NaN 

INF:12-NaN 

INV: 12-NaN 

INV: 12-NaN 

INV: 12-NaN 

INV:14-NaN 

INF 

INV:15-NaN 

INV:15-NaN 

OK: 1-INF 

OK: 1-INF 

INV:13-NaN 

NRM 

OK:0-ZERO 

OK:0-ZERO 

0VF:5-(1) 
OK:2,3-NRM 
UNF:6,7-ZERO 

OK: 1-INF 

INV:13-NaN 

DNRM 

OK:0-ZERO 

OK:-ZERO 

OK:0-ZERO 

INV:15-NaN 

INV:13-NaN 

ZERO 

OK:0-ZERO 

OK:0-ZERO 

OK:0-ZERO 

INV:15-NaN 

INV:13-NaN 


Notes: 


1. OVF will produce INF or MAX.NRM, depending upon the rounding mode: 
+MAX.NRM IF [(RM, RZ) AND (RESULT IS +)] 

-MAX.NRM IF [(RP, RZ) AND (RESULT IS -)] 

+INF IF [(RN, RP) AND (RESULT IS +)] 

-INF IF [(RN, RM) AND (RESULT IS -)] 
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Operations, continued 


TABLE 7: FLOATING POINT ADD/SUBTRACT (IEEE MODE) 

A/B 

ZERO 

DNRM 

NRM 

INF 

NaN 

NaN 

INF:12-NaN 

INV:12-NaN 

INV:12-NaN 

INV:12-NaN 

INV:14-NaN 

INF 

OK: 1-INF 

OK: 1-INF 

OK: 1-INF 

INV:15-NaN (2) 
OK: 1-INF (1) 

INV:13-NaN 

NRM 

OK:2-NRM 

OVF:5-(4) 

OK:2,3-NRM 

UNF:6,7-UNRM 

OVF: 5-(4) 
OK:2,3-NRM 
UNF:6,7-UNRM 
OK:0-ZERO 

OK: 1-INF 

INV:13-NaN 

DNRM 

UNF:6-UNRM 

OK:0-ZERO (3) 
UNF:6-UNRM 
OK:2-NRM 

OK:2,3-NRM 

UNF:6,7-UNRM 

OVF:5-(4) 

OK: 1-INF 

INV:13-NaN 

ZERO 

OK:0-ZERO (3) 

UNF:6-UNRM 

OK-.2-NRM 

OK: 1-INF 

INV:13-NaN 


Notes: 


1. 

+INF+INF 


+INF 




-INF-INF 

—f 

-INF 



2. 

+INF-INF 

—f 

NaN 




-INF+INF 


NaN 



3. 

+ZERO+ZERO 


+ZERO 

(RN, RZ, 

RP, RM) 


-ZERO-ZERO 

— ¥ 

-ZERO 

(RN, RZ, 

RP, RM) 


+ZERO-ZERO 

—f 

+ZERO 

(RN, RZ, 

RP) 


+ZERO-ZERO 


-ZERO 

(RM) 



-ZERO+ZERO 

—► 

+ZERO 

(RN, RZ, 

RP) 


-ZERO+ZERO 

—► 

-ZERO 

(RM) 




4. OVF will produce INF or MAX.NRM, depending upon the rounding mode: 


+MAX.NRM 

-MAX.NRM 

+INF 

-INF 


IF 

IF 

IF 

IF 


[(RM, RZ) 
[(RP, RZ) 
[(RN, RP) 
[(RN, RM) 


AND (RESULT IS +)] 
AND (RESULT IS -)] 
AND (RESULT IS +)] 
AND (RESULT IS -)] 
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WTL 1164/WTL 1165 64-BIT IEEE 
FLOATING POINT MULTIPLIER/ 
DIVIDER AND ALU 


PRELIMINARY DATA 


TABLE 8: FLOATING POINT MULTIPLICATION (IEEE MODE) 

A/B ! 

ZERO 

DNRM 

NRM 

INF 

NaN 

NaN S 

INF:12-NaN 

INV:12-NaN 

INV: 12-NaN 

INV: 12-NaN 

INV:14-NaN 

INF 

INV:15-NaN 

OK: 1-INF 

OK: 1-INF 

OK: 1-INF 

INV:13-NaN 

NRM 

OK:0-ZERO 

DIN:9-(U) * 

OVF: 5-(2) 
OK:2,3-NRM 
UNF:6,7-UNRM 

OK: 1-INF 

INV:13-NaN 

DNRM 

OK:0-ZERO 

DIN:10-(U) * 

DIN:8-(U) * 

OK: 1-INF 

INV:13-NaN 

ZERO 

OK:0-ZERO 

OK: 1-ZERO 

OK: 1-ZERO 

INV:15-NaN 

INV:13-NaN 


Note: 

(1) (U) result is undefined. 


(2) OVF will produce INF or MAX.NRM, depending upon the rounding mode: 


+MAX.NRM IF [(RM, RZ) AND (RESULT IS +)] 

+MAX.NRM IF [(RP, RZ) AND (RESULT IS -)] 

+INF IF [(RN, RP) AND (RESULT IS +)] 

-INF IF [(RN, RM) AND (RESULT IS -)] 


TABLE 9: FLOATING POINT DIVIDE (“FAST” MODE) 

A/B 

ZERO 

DNRM 

NRM 

INF 

NaN 

NaN 

INV: 12-NaN 

INV: 12-NaN 

INV: 12-NaN 

INV: 12-NaN 

INV:14-NaN 

INF 

OK: 1-INF 

OK: 1-INF 

OK: 1-INF 

INV:15-NaN 

INV:13-NaN 

NRM 

DVZ:11-INF 

DVZ:11-INF 

OK:2,3-NRM 
OVF:5-(1) 
UNF:6,7-UNRM 

OK.-O-ZERO 

INV:13-NaN 

DNRM 

INV:15-NaN 

INV:15-NaN 

OK:0-ZERO 

OK:0-ZERO 

INV:13-NaN 

ZERO 

INV:15-NaN 

INV:15-NaN 

OK:0-ZERO 

OK:0-ZERO 

INV:13-NaN 
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Operations, continued 


Note: 


(1) OVF will produce INF or MAX.NRM, depending upon the rounding mode: 


+MAX.NRM 

+MAX.NRM 

+INF 

-INF 


IF [(RM, RZ) AND (RESULT IS +)] 
IF [(RP, RZ) AND (RESULT IS -)] 
IF [(RN, RP) AND (RESULT IS +)] 
IF [(RN, RM) AND (RESULT IS -)] 


TABLE 10: FLOATING POINT DIVIDE (IEEE MODE) 

A/B 

ZERO 

DNRM 

NRM 

INF 

NaN 

NaN 

INV:12-NaN 

INV:12-NaN 

INV:12-NaN 

INV:12-NaN 

INV:14-NaN 

INF 

OK: 1-INF 

BDIN:9-(2) 

INF: 1-INF 

INV:15-NaN 

INV:13-NaN 

NRM 

DVZ:11-INF 

BDIN:9-(2) 

OK:2,3-NRM 

OVF:5-(1) 

UNF:6,7-UNRM 

OK:0-ZERO 

INV:13-NaN 

DNRM 

DVZ:11-INF 

ABDIN:10-(2) 

ADIN:8-(2) 

i 

ADIN:8-(2) 

INV:13-NaN 

ZERO 

INV:15-NaN 

BDIN:9-(2) 

OK:0-ZERO 

OK:0-ZERO 

INV:13-NaN 


Note: 

(1) OVF will produce INF or MAX.NRM, depending upon the rounding mode: 

+MAX.NRM IF [(RM, RZ) AND (RESULT IS +)] 

+MAX.NRM IF [(RP, RZ) AND (RESULT IS -)] 

+INF IF [(RN, RP) AND (RESULT IS +)] 

-INF IF [(RN, RM) AND (RESULT IS -)] 

(2) Result is undefined. 
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Operations, continued 


TABLE 11: FLOATING POINT COMPARE STATUS 

A/B 

NaN 

-INF 

-NRM 

-DNRM 

ZERO 

+DNRM 

+NRM 

+INF 

NaN 

U:15 

U: 15 

U:15 

U:15 

U:15 

U:15 

U:15 

U:15 

+INF 

U:15 

G:2 

G:2 

G:2 

G:2 

G:2 

G:2 

E:0 

+NRM 

U: 15 

G:2 

G:2 

G:2 

G:2 

G:2 

:0, 1, 2 

L:1 

+DNRM 

U: 15 

G:2 

G:2 

G:2 

G:2 

:0, 1, 2 

L: 1 

L:1 

ZERO 

U: 15 

G:2 

G:2 

G:2 

E:0 

L:1 

L:1 

L:1 

-DNRM 

U:15 

G:2 

G:2 

:0, 1.2 

L:1 

L: 1 

L: 1 

L:1 

-NRM 

U:15 

G:2 

:0, 1, 2 

L:1 

L:1 

L:1 

L:1 

L:1 

-INF 

U:.15 

E:0 

L:1 

L:1 

L:1 

L:1 

L:1 

L:1 


FORMAT: Condition: Status Code(s) 

U:15 := unordered (status = 15) 

E:0 := equal (status = 0) 

L:1 := A < B (status = 1) 

G:2 := A > B (status = 2) 

:0, 1,2 := may be A = B, A < B, or A > B, depending upon data values 


TABLE 12: CONVERT SINGLE TO DOUBLE 

F32 -*■ F64 

F32 OPERAND 

F64 RESULT 

STATUS 

COMMENTS 

7FFFFFFF 

or FFFFFFFF 

7FFFFFFF 

FFFFFFFF 

12 

A operand is NaN 

7F800000 

7FF00000 

00000000 

1 

+INF 

7F7FFFFF 

47EFFFFF 

E0000000 

2 

Input operand is 
+MAX.NRM 

3F800000 

3FF00000 

00000000 

2 

+ 1 

00800000 

38100000 

00000000 

2 

Input operand is 

+MIN.NRM 

007FFFFF 

380FFFFF 

cooooooo 

2 

i 

Input operand is 
+MAX.DNRM 

Result = 0 in “FAST” mode 

00000001 

36A00000 

00000000 

2 

Input operand is 
+MIN.DNRM 

Result = 0 in “FAST” mode 

00000000 

00000000 

00000000 

0 

+ZERO 


Note: Sign bit is orthogonal; it is directly copied from the input operand to the output result (except for NaN 

which is clamped to zero). 
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Operations, continued 


TABLE 13: CONVERT DOUBLE TO SINGLE 

F64 —► F32 (Round to Nearest) 

F64 OPERAND 

F32 RESULT 

STATUS 

COMMENTS 

7FFFFFFF 

7FFFFFFF 

12 

A operand is NaN 

FFFFFFFF 




7FFOOOOO 

7F800000 

1 

+INF 

00000000 




7FEFFFFF 

7F800000 

5 

+MAX.NRM 

FFFFFFFF 



OVERFLOWS 

47EFFFFF 

7F800000 

5 

+OVF RESULT 

F0000000 




47EFFFFF 

7F7FFFFF 

2 

+MAX.NRM 

E0000000 



RESULT 

3FF00000 

3F800000 

2 

+ 1 

00000000 




38100000 

00800000 

2 

+MIN.NRM 

00000000 



RESULT 

380FFFFF 

00800000 

3 

Result after rounding 

FFFFFFFF 



is +MIN.NRM 

38000000 

00000000 (UNRM) 1165 

6 

Result 

00000000 



underflows 

36FFFFFF 

77FFFFFF (UNRM) 1165 

7 

Produces 

FFFFFFFF 



UNRM + NXT 

36A00000 

75000000 (UNRM) 1165 

6 

+MIN.DNRM 

00000000 



RESULT 

00000000 

40000000 (UNRM) 1165 

7 

Input is 

00000001 



DNRM 

00000000 

00000000 

0 

ZERO 

00000000 






Note: Sign bit is orthogonal; it is directly copied from the input operand to the output result (except for NaN 

which is clamped to zero). 
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WTL 1164/WTL 1165 64-BIT IEEE 
FLOATING POINT MULTIPLIER/ 
DIVIDER AND ALU 


PRELIMINARY DATA 


Operations, continued 


TABLE 14: DOUBLE FLOAT 

132 -► F64 

132 OPERAND 

F64 RESULT 

STATUS 

COMMENTS 

7FFFFFFF 

41DFFFFF 

FFC00000 

2 

Largest Positive 

Integer 

00000001 

3FF00000 

00000000 

2 

+ 1 

00000000 

00000000 

00000000 

0 

ZERO 

FFFFFFFF 

BFF00000 

00000000 

2 


80000000 

C1E00000 

00000000 

2 

Largest Negative 

Integer 


TABLE 15: SINGLE FLOAT 

132 -*• F32 

132 OPERAND 

F32 RESULT 

STATUS 

COMMENTS 

7FFFFFFF 

4F000000 

3 

Largest Positive Integer 

7FFFFFC0 

4F000000 

3 

INEXACT 

7FFFFF80 

4EFFFFFF 

2 

EXACT 

00000001 

3F800000 

2 

+ 1 

00000000 

00000000 

0 

ZERO 

FFFFFFFF 

BF800000 

2 

~1 

80000080 

CEFFFFFF 

2 

EXACT 

80000040 

CF000000 

3 

INEXACT 

80000000 

CF000000 

2 

Largest Negative Integer 
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Operations, continued 


TABLE 16: DOUBLE FIX 

F64 —► 132 (Round to Nearest) 

F64 OPERAND 

132 RESULT 

STATUS 

COMMENTS 

7FFFFFFF 

FFFFFFFF 

7FFFFFFF 

12 

Input is NaN 

7FF00000 

00000000 

7FFFFFFF 

5 

+INF 

7FEFFFFF 

FFFFFFFF 

7FFFFFFF 

5 

Input is 
+MAX.NRM 

41DFFFFF 

FFC00000 

7FFFFFFF 

2 

Largest Positive 

Inteaer Result 

3FF00000 

00000000 

00000001 

2 

+1 

3FE80000 

00000000 

00000001 

3 

INEXACT 

00100000 

00000000 

00000000 

3 

Input is 

MIN.NRM 

00000000 

00000001 

00000000 

3 

Input is 

MIN.DNRM 

00000000 

00000000 

00000000 

0 

+ZERO 

80000000 

00000000 

00000000 

0 

-ZERO 

8FF00000 

00000000 

00000000 

3 

Small Negative 

Number 

BFF00000 

00000000 

FFFFFFFF 

2 

-1 

C1E00000 

00000000 

80000000 

5 

Largest Negative 

Inteaer Result 

FFF00000 

00000000 

80000000 

5 

-INF 

FFFFFFFF 

FFFFFFFF 

7FFFFFFF 

12 

-NaN 



WTL 1164/WTL 1165 64-BIT IEEE 
FLOATING POINT MULTIPLIER/ 
DIVIDER AND ALU 


PRELIMINARY DATA 


Operations, continued 


TABLE 17: SINGLE FIX 

F32 -*■ 132 

F32 OPERAND 

132 RESULT j 

STATUS 

COMMENTS 

7FFFFFFF 

7FFFFFFF 

12 

Input is NaN 

7F800000 

7FFFFFFF 

5 

+INF 

7F7FFFFF 

7FFFFFFF 

5 

Input is +MAX.NRM 

4F000000 

7FFFFFFF 

5 

+OVF 

4EFFFFFF 

7FFFFF80 

2 

EXACT 

3F800000 

00000001 

2 

+1 

3F400000 

00000001 

3 

INEXACT 

00800000 

00000000 

3 

Input is +MIN.NRM 

00000001 

00000000 

3 

Input is +MIN.DNRM 

00000000 

00000000 

0 

+ZERO 

80000000 

00000000 

0 

-ZERO 

8F800000 

00000000 

3 

Small Negative Number 

BF800000 

FFFFFFFF 

2 

-1 

CEFFFFFF 

80000080 

2 

Large Negative Number 

CF000001 

80000000 

5 

-OVF 

FF800000 

80000000 

5 

-INF 



TABLE 18: DOUBLE WRAP DENORMALIZED VALUE 

F64 -»• W64 

F64 OPERAND 

W64 RESULT 

STATUS 

COMMENTS 

00000000 

7CD00000 

6 

Input is +MIN.DNRM 

00000001 

00000000 



00080000 

00000000 

6 

Always Exact 

00000000 

00000000 



000FFFFF 

000FFFFF 

6 

Input is +MAX.DNRM 

FFFFFFFF 

FFFFFFFE 




TABLE 19: SINGLE WRAP DENORMALIZED VALUE 

F32 -*■ W32 

F32 OPERAND 

W32 RESULT 

STATUS 

COMMENTS 

00000001 

75000000 

6 

Input is -MIN.DNRM 

00400000 

00000000 

6 

Always Exact 

007FFFFF 

007FFFFE 

6 

Input is -MAX.DNRM 


c 
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Operations, continued 


TABLE 20: DOUBLE UNWRAP EXACT VALUE 

U64 -f D64 

U64 OPERAND 

D64 RESULT 

STATUS 

COMMENTS 

000FFFFF 

00100000 

3 

Result is 

FFFFFFFF 

00000000 


NRM + NXT 

00000000 

00080000 

6 

UNF + EXACT 

00000000 

00000000 



7FFFFFFF 

00080000 

7 

UNF + NXT 

FFFFFFFF 

00000000 



40000000 

00000000 

7 

UNF + NXT 

00000000 

00000000 




TABLE 21: SINGLE UNWRAP EXACT VALUE 

U32 —*■ D32 

U32 OPERAND 

D32 RESULT 

STATUS 

COMMENTS 

007FFFFF 

00800000 

3 

Result is NRM 

007FFFFE 

007FFFFF 

6 

Result is DNRM 

00000000 

00400000 

6 

UNF + EXACT 

7FFFFFFF 

00400000 

7 

UNF + NXT 

40000000 

00000000 

7 

UNF + NXT 



Note: For single and double unwrap functions, the sign bit of the output result is directly copied from the 

sign bit of the input operand. 
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WTL 1164/WTL 1165 64-BIT IEEE 
FLOATING POINT MULTIPLIER/ 
DIVIDER AND ALU 


PRELIMINARY DATA 


Pin Configuration 

The WTL 1164/WTL 1165 will be supplied in 64-pin allocated as shown. The four additional pins on the 

DIP and 68-pin pin grid array packages. Pins are 68-pin packages are not connected. 


© JC = 6° C/Watt 

© JA = 13° C/Watt - 
200 CFM, with heat 
sink 

0 JA = 20° C/Watt - 
200 CFM, no heat 
sink 

* New PGA footprint; 
use for systems 
designed after 
July 11, 1986. 

This part is desginated 
with the GCB suffix in 
the ordering informa¬ 
tion. 




© 

© 

© 

® 

© 

© 

© 

© 




TOP VIEW 

DIE CAVITY AND LID 
ON UNDERSIDE 
OF PACKAGE 


©©©©©©©© 



D O0©0©©©©© 


PIN # 

PGA 

DIP 

PIN # 

PGA 

DIP 

PIN # 

PGA 

DIP 

PIN# 

PGA 

DIP 

mmMm 

OE 

X17 

18 

X25 

X2 

35 

VCC 

LI 

52 

GND 

X28 


csus- 

X16 

19 

X24 

XI 

36 

GND 

L2 

lijisiijiii! 

CLK 

X27 

iiiiiiiiigll 

VDD 

X15 

20 

X23 

XO 

37 

X9 

L3 

mmMm 

F0 

X26 

iiiihiii4iiiiiiiii 

GND 

X14 

ii!i2Hiiiiiiii 

X22 

GND 

38 

X8 

VDD 

mmsm 

FI 

GND 

WM i|| 

S3 

X13 

§MMM 

X21 

VDD 

39 

X7 

GND 

MMem: 

F2 

VDD 

6 

S2 

X12 

23 

X20 

CSUX- 

40 

X6 

VCC 

wmm 

F3 

X25 

W'MMM 

SI 

XII 

mm* 

X19 

U 

41 

X5 

OE- 

mMm 

F4 

X24 

M'Mm 

SO 

X10 

25 

X18 

GND 

42 

X4 

CSUS- 

59 

F5 

X23 

mMm 

X31 

VCC 

26 

X17 

CLK 

. 43 

X3 

VDD 


CSL- 

X22 

iiiisiiiDiliii 

X30 

GND 

mmm 

X16 

F0 

iiiibiiiiiiii 

X2 

GND 

iiiiiiiiBiiiiiiiiii 

L0 

X21 

ii:: 

X29 

X9 

iliiaaii! 

X15 

FI 

l:::45:;;i; 

XI 

S3 

62 

LI 

X20 

^ : ^l2;i;: 

X28 

X8 

29 

X14 

F2 

46 

XO 

S2 

63 

L2 

X19 

^ -TiSiii 

X27 

X7 

30 

X13 

F3 

47 

GND 

SI 

64 

L3 

X18 


X26 

X6 

WMm 

X12 

F4 

'MMM 

VDD 

SO 

65 

.1 

VDD 


itiiii! 

GND 

X5 


XII 

F5 

Mm&MM 

CSUX 

X31 

: 66 

GND 



VCC 

X4 

33 ! 

X10 

CSL- 


NC 

X30 


VCC 



N/C 

X3 

34 ;i!i 

NC 

L0 

lilll 

U 

X29 

WM : mi 

NC 
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Physical Dimensions 


64-PIN DUAL IN-LINE PACKAGE 



Symbol 

LIMITS 

INCHES MM 

MIN 

MAX 

MIN 

MAX 

A*** 

.135 

.250 

3.43 

11.43 

b 

.015 

.022 

0.38 

0.56 

bl 

.030 

.060 

0.76 

1.52 

c 

.008 

.013 

0.20 

0.33 

D 

3.140 

3.260 

79.76 

82.80 

E 

.775 

.825 

19.69 

20.96 

El 

.880 

.920 

22.35 

23.37 

e 

.090 

.110 

2.29 

2.79 

L 

.120 

.160 

3.05 

4.06 

Q 

.040 

.100 

1.02 

2.54 

a 

0 ° 

15° 

0 ° 

15° 

s* 


.098 


2.49 

si** 

.005 


1.27 



* From centerline of end lead 
* * From edge of end lead 
*** Includes heat sink 


68 -PIN PIN GRID ARRAY PACKAGE 



Symbol 

Measurements 

Inches 

Millimeters 

Ai 

.157 ± 0.017 

4.04 + .43 

a 2 

0.160 1 0.008 

4.06 t .20 

A 3 

0.050 + 0.006 


D 

■nsnsEisn 

27.94 t .30 

E 

1.000 + 0.012 

25.4 + .30 

Ei 

0.800 + 0.012 

20.32 i .30 

e 2 

0.018 dia. typ. 

.46 dia. typ. 

e 3 

0.050 dia. typ 

1.27 dia. typ 



o 
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WTL 1164/WTL 1165 64-BIT IEEE 
FLOATING POINT MULTIPLIER/ 
DIVIDER AND ALU 


PRELIMINARY DATA 


Ordering Information 


PACKAGE TYPE 

TEMPERATURE RANGE 

(Tcase) 

ORDER NUMBER 

Hermetic DIP 

T = 0 to +80° C 

WTL 1164-080-JC/WTL1165-080-JC 

Pin Grid Array 

T = 0 to +80° C 

WTL 1164-080-GC/WTL1165-080-GC 

Hermetic DIP 

T = 0 to +80° C 

WTL 1164-060-JC/WTL1165-060-JC 

Pin Grid Array 

T = 0 to +80° C 

WTL 1164-060-GC/WTL1165-060-GC 
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Revision Summary 


1 . 

Features 

Revised 

page 1 

2. 

Description 

Revised 

pages 1-2 

3. 

Pin Definitions 

Revised 

page 2 

4. 

Recommended Operating Conditions 

Revised 

page 4 

5. 

DC Electrical Characteristics 

Revised 

page 4 

6. 

AC Switching Characteristics 

Revised 

page 5 

7. 

I/O Characteristics 

Revised 

page 6 

8. 

Load Controls 

Revised 

page 7 

9. 

Load Notes 

New 

page 7 

10. 

Function Controls for Floating Point ALU 

Revised 

page 10 

11. 

Table 4. Function Controls for Floating 

Point ALU 

Revised 

page 13 

12. 

Result Status 

Revised 

page 15 

13. 

Figure 7. Single Precision Multiply 

Revised 

page 16 

14. 

IEEE Compatability 

Revised 

pages 27-28 

15. 

Gradual Underflow 

Revised 

pages 31-32 

16. 

Operations (Tables 5-21) 

Revised 

pages 32-41 

17. 

Pin Configuration 

Revised 

page 42 
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For additional information on WEITEK products, please fill out the form below and mail. 


Name Title 



Company 

Phone 


Address 



Comments 

I am currently involved in a design with the following Weitek products 
design data base to insure that I receive status updates. 


and wish to be added to vour 


APPLICATION: 


□ ENGINEERING WORKSTATIONS 

□ GRAPHICS 

□ PERSONAL COMPUTERS 


□ SCIENTIFIC COMPUTERS 

□ OTHER _ 


Check the products on which you wish to receive data sheets: □ Have a sales person call 

ATTACHED PROCESSORS COPROCESSORS BUILDING BLOCKS 


□ XL-SERIES OVERVIEW 

□ XL-8200 OVERVIEW 

□ 1167 

□ 1164/1165 

□ 3164/3364 

□ 3167 

□ 2264/2265 

□ 3132/3332 

□ 1232/1233 

□ 1066 
□ 2010 
□ 2245 

□ 2516 

□ 2517 


WEITEK use: Rec’d 


Out 

TPT 


Source: DS 


Status 


WEITEK WTL1164/WTL1165 

Please Comment On The Quality Of This Data Sheet. 

Even though we have tried to make this data sheet as complete as possible, it is conceivable that we have 
missed something that may be important to you. If you believe this is the case, please describe what the 
missing information is, and we will consider including it in the next printing of the data sheet. 


Fold, Staple and Mail to Weitek Corp. 


NO POSTAGE 
NECESSARY 
IF MAILED 
IN THE 

UNITED STATES 


BUSINESS REPLY MAIL 

FIRSTCLASS PERMIT NO. 1374 SUNNYVALE.CA 


POSTAGE WILL BE PAID BY ADDRESSEE 

WEITEK Corporation 
1060 E. Arques Ave. 

Sunnyvale, CA 94086-BRM-9759 


ATTN: Ed Masuda 







( 



WEITEK’S CUSTOMER COMMITMENT: 


WEITEK 

A 


A 


Weitek’s mission is simple: to provide you with VLSI solutions 
to solve your compute-intensive problems. We translate that 
mission into the following corporate objectives: 

1. To be first to market with performance breakthroughs, allow¬ 
ing you to develop and market systems at the edge of your art. 

2 . To understand your product, technology, and market needs, so 
that we can develop Weitek products and corporate plans that 
will help you succeed. 

3. To price our products based on the fair value they represent to 
you, our customers. 

4 . To invest far in excess of the industry average in Research and 
Development, giving you the latest products through techno¬ 
logical innovation. 

5 . To invest far in excess of the industry average in Selling, Mar¬ 
keting, and Technical Applications Support, in order to pro¬ 
vide you with service and support unmatched in the industry. 

6 . To serve as a reliable, resourceful, and quality business part¬ 
ner to our customers. 

These are our objectives. We’re committed to making them 
happen. If you have comments or suggestions on how we can 
do more for you, please don’t hesitate to contact us. 



President 


A 





Headquarters 

Weitek Corporation 
1060 E. Arques Avenue 
Sunnyvale, CA 94086 
TWX 910-339-9545 
WEITEK SVL 
FAX (408) 738-1185 
TEL (408) 738-8400 


Domestic Sales Offices 
Weitek Corporation 
1060 E. Arques Avenue 
Sunnyvale, CA 94086 
TWX 910-339-9545 
WEITEK SVL 
FAX (408) 738-1185 
TEL (408) 738-8400 


Corporate Place IV 
111 South Bedford St. 
Suite 200 

Burlington, MA 01803 
FAX (617) 229-4902 
TEL (617) 229-8080 


European Sales Headquarters Japanese Representative j —, 

Greyhound House, 23/24 George St. C. Itoh Techno/Sciences « 
Richmond, Surrey, TW91JY Company Ltd. 

England C. Itoh Building 

TELEX 928940 RICHBIG 2-5-1 Kita-Aoyama 

FAX 011-441 940 6208 Minato-Ku, Tokyo 107 

TEL 011-441 549 0164 TELEX 781242 3240 

FAX (81) 3-497-4879 
TEL (81) 3-497-4975 




